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SUMMARY 

Exon trapping allows for the rapid identification and cloning of coding regions from cloned eukaryotk DNA- In 
prehmmary Experiments, we observed two phenomena which limited the exon-trappmg effiaency of pSPU-based 
s terns tL first factor ihat affected performance was revealed when we found that up to 50% of the 
Ins contained sequences derived from the intron of the P SPL3 trapping vector. we lustra 
responsible for the cryptic splice event from the original splicing vector resulted in a new vector, pSPL3B. We <^°"""* 
uXsPL3B virtual^elimfnates pSPL3-only spliced products while maximizing the proportion of exon traps contammg 
Gnomic DNA (>98%). The other step which impacted performance was our observation that a majority of the 
SdL^tant (Ap") clones produced after shotgun subcloning from Ap" ^^^^^ 
pSPL3-deficient, regularized cosmid vector fragments. Replacement of the pSPL3 Ap gene with the Cm cassette 
SLg chloramphenicol (Cm) acetyltransferas* enabled selection for only 1*^^^^^^ 
30-40-fold increase in the initial subcloning efficiency of cosmid-denved fragments with pSPL3-CAM when compa ed 
to pSPL3. The collective vector alterations described improve the overall exon-trapping efficiency of the pSPL3-based 
tripping system. 



INTRODUCTION 

The method of exon trapping has been recently 
described for the rapid and efficient isolation of coding 
regions from genomic DNA (Auch and Reth, 1990; Duyk 
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et aI M 1990; Buckler et al., 1991: Church et al., 1994). One 
of the more widely used exon-trapping procedure utilizes 
the pSPL3 plasmid which contains rabbit p-globin 
coding sequences separated by a portion of the HIV-rat 
gene (Buckler et al., 1991: Church et al., 1993; 1994). 
Genomic DNA ( YAC, cosmid or PI clones) is cloned into 
the intron of the tat gene and.the resulting subclones are 
transfected into COS-7 cells. SV40 sequences in the vector 
allow for both replication of the pSPL3 containing DNAs 
and transcription of the cloned genomic DNAs. Exons 
within the subcloned genomic DNAs that are spliced into 
the globin-rat transcript are recovered by RT-PCR. A 
major advantage of exon trapping is that the expression 
of the subcloned DNA is directed by a viral promoter, 
thus developmental and tissue specific expression of genes 

V^u^ayeinitiatea^udies to assemble a transcriptional 
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map for a 750-kb CpG-rich region of human chromo- 
some 16pl3.3 using exon trapping. This region of chro- 
mosome 16 is of particular interest since it contains the 
gene responsible for greater than 90% of all cases of 
autosomal dominant polycystic disease (APKD; Reeders 
et al., 1985; Germino et al., 1992; Somlo et al., 1992; The 
European Polycystic Kidney Disease Consortium, 1994). 
It also contains the disease gene for tuberous sclerosis 
(TSC2; The European Chromosome 16 Tuberous 
Sclerosis Consortium, 1993) and possibly the gene 
responsible for familial Mediterranean fever (MEF; 
Aksentijevich et aL, 1993). We have proceeded to use 
exon trapping to identify coding sequences from the 
PKD-1 locus. In our initial experiments, we observed that 
a significant percentage of the trapped products con- 
tained a pSPL3-derived cryptic exon from within the 
HlV-tat intron. We also observed a poor subcloning 
efficiency in the initial shotgun subcloning of genomic 
cosmid clones into the pSPL3 vector, with the majority 
of the ligation products being rccircularized cosmid 
vector. We report the identification of a cryptic exon 
within the pSPL3 vector and demonstrate the utility and 
efficiency of a modified pSPL3 vector lacking the cryptic 
splice sites. We also demonstrate the utility of a modified 
pSPL3 vector in which the Ap R gene has been replaced 
by the Cm* gene. 
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EXPERIMENTAL AND DISCUSSION 

(a) The pSPL3 splicing vector contains a cryptic exon 

Exon trapping, as described by Buckler et al. (1991) 
and Church et aL (1993; 1994), identifies secondary PCR 
products larger than the 177-bp product generated by 
the pSPL3 vector alone. In our initial studies, we found 
that up to 50% of our putative trapped exons were of a 
single size (approx. 120-bp longer in size than the 177-bp 
pSPL3 product) and were not derived from the parental 
PI or cosmid inserts. The prevalence of the unexplained 
exon was independent of the initial subcloning efficiency 
and solely dependent on pSPL3, which alone yielded the 
cryptic exon in up to 50% of the cases (Fig. 1, lane 2). 
Characterization of the cryptic splice product indicated 
that it contained 116 nt of sequence from the intronic 
portion of pSPL3 (Fig. 2). Sequences 5' of the cryptic 
exon were found to contain a pyrimidine-rich tract and 
a 5' splice acceptor site with some homology to the pub- 
lished consensus sequence (Mount, 1982), while the 3' 
splice donor site of the cryptic exon contained sequences 
which are nearly identical to the published consensus 
sequence (Mount, 1982). 

To increase the efficiency and reliability of exon trap- 
ping, we removed the cryptic exon using convenient 



Fig. 1. Comparison of the pSPU and pSPL3B trapping vectors. 
(A) EtdBr-stained get of PCR products from an exon-trappttig experiment 
using vector without insert (Nl) or vector into which genomic DNA 
from the 96.4B bacteriophage PI clone had been shotgun subdoned 
(96 43) A PCR reaction in which no template was added (No DNA) 
and a 100-bp size ladder are also shown. The 177-bp product in the 
PSPL3B Nl (no insert) lane is the globin-tar RT-PCR product without 
an internal trapped exon. (B) The same gel was blotted to nylon mem- 
brane and hybridized with the HIV-raf intron. Methods: The 
bacteriophage PI clone was cut to completion with BamHI + Bglil ano 
shotgun subclooed into dephosphorylated BomHI-cut pSPU or 
PSPL3B. After overnight ligation at I6°Q the reactions were trans- 
formed into competent £. coll HB101. The following day, colonics were . 
scraped ofif those plates having the highest transformation efficiency 
(based on comparison to 'no insert' control ligations). Alkaline ysw 
minipreps were performed on the plate scrape pools and the resulting 
miniprep DNA was electroporated into COS-7 as described (Churcn 
et al 1994). At about 48 h post^electroporation, RT-PCR was per- 
formed as described by the manufacturer (Life Technology 
Gaithersburg, MD, USA). The resulting secondary PCR products were 
separated on a 1.5% agarose gel and blotted to nylon membrane. 1^ 
HtV-wt intron was amplified using PCR with primers at nt ft/ 1 
691 (5-TGCACGCTCTAGAGTCG) and 3102-3116 (5-ATACCCO 
TCGGGAGATC) of pSPL3. The resulting PCR product was g»- 
purified and labeled by random priming (Feinberg and Vogelstan. 
1983). The blot was hybridized at 65*C in 0.25 M NaphosphaK 
pH 7.2/7% SDS, and washed in 0.2 x SSC/0-l%SDS at 65'C for 30 m«v 
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SFig. 1 Removal of the crytpic exon from the pSPL3 splicing vector. 
f(A) The nt sequence for the cryptic exon's 5' splice acceptor and 3' 
pspJice donor Bites. The numbers above the sequence indicate the position 
|bf the ht in the pSPL3 vector. The cryptic exon is indicated by the 
Efsbaded box. Sequences with homology to the consensus splice sites 
jf(shown below pSPL3 sequence) are underlined. (B) Schematic diagram 
Of the pSPL3 splicing vector. The SV40 promoter is indicated by an 
arrow. Open boxes indicate the globin coding sequences while the black 
boxes represent the HlV-tat splice donor and acceptor sequences. The 
solid black line designates the HlV-tar intronic sequences including the 
cryptic exon (hatched box). The position of the MCS, DralU (D) nnd 
Stul (S) sites are also shown. (C) The splicing reactions which lead to 
tile wild-type (wt) and cryptic (C,and C 2 ) splice products are indicated. 
{D) Schematic diagram of the pSPL3B vector which lacks the cryptic 
'exon. The cryptic exon from pSPL3 was removed by digesting pSPL3 
(LifeTechnologies) with Dralll + Stul. A 1-kb Stul-Dralll fragment con- 
taining nt 321 to 13U of pSPL3 was gel-purified, and the Drain end 
>as blunt-ended using Pollk. The resulting blunt-ended fragment was 
subcloned into pSPL3 which had been digested with Stul to remove nt 
321 to 154%. The resulting vector, pSPL3B, is identical to pSPL3 except 
|for the absence of nt 131 1 through 1548. Deletion of the cryptic exon 
■*as verified by nt sequence analysis. 

restriction enzyme sites to generate pSPL3B (as described 
in the legend to Fig. 2). A side by side comparison of 
PSPL3 and pSPL3B was performed to demonstrate the 
tility of the modified vector. DNA from the genomic P J 



clone 96.4B was shotgun subcloned into either pSPL3 or 
pSPL3B. The subcloning efficiency of the two reactions 
was equivalent based on transformation efficiencies and 
restriction digestion analysis (data not shown). Exon 
trapping was performed in parallel and the secondary 
PCR products were separated on an agarose gel and 
Southern blotted prior to subcloning. The removal of the 
cryptic exon did not decrease our ability to trap exons 
as judged by the number and intensity of bands on an 
EtdBr-stained agarose gel (Fig. 1A); however, hybridiza- 
tion with the HIV-tat intron from pSPL3 indicated that 
cryptic products were present in the pSPL3 reactions but 
not in the pSPL3B reactions (Fig. IB). We also observed 
hybridization of the HIV-taf intron probe to a second 
less abundant product at 375 bp (Fig. IB). The second 
product appears to result from the cryptic exon's use of 
a minor splice acceptor or donor site, since it is" also 
eliminated by the 238-bp deletion in pSPL3B. These data 
indicate that the removal of the cryptic exon significantly 
reduces the levels of non-specific products in the 
secondary PCR reactions. 

(b) Removal of the cryptic exon reduces the percentage of 
subcloned products containing HIV-ta* sequences 

To demonstrate that removal of the cryptic splice pro- 
ducts decreases the percentage of non-specific products 
in the final cloning step, we hybridized the pSPL3 intron 
to colony lifts from a number of exon-trapping experi- 
ments. Exon trapping was performed on three PI clones 
and colony hybridizations were performed on the result- 
ing subcloned exons using the HIV-faf intron as a probe. 
The results from this comparison are summarized in 
Table I. With the original pSPL3 vector, 5-50% of all 
products obtained by exon trapping contained pSPL3 
intronic sequences, while 0-2% contained cryptic pro- 
ducts for pSPL3B. When only the experiments using PI 
clones were considered, 22% of all the traps from the 
pSPL3 experiment were HIV-far derived, while less than 
1% were HIV-tat derived in the pSPL3B experiments. 
The products hybridizing to the HIV-raf intron in the 
pSPL3B experiments presumably represent a less effi- 
ciently spliced cryptic exon(s) which is still present in 
the vector 

(c) Replacement of the pSPL3 Ap R gene with the Cm* 
gene significantly increases shotgun subcloning efficiency 
of cosmid clones 

A pSPL3 derivative containing the Cm R gene instead 
of the Ap R gene has also been produced. As mentioned 
above, recircularized cosmid vector significantly reduced 
the overall efficiency of subcloning into the pSPL3 vector. 
Despite optimization of ligation conditions, as many as 
95% of all Ap R colonies were found to be cosmid derived 
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Clone 


Cryptic exons* 






pSPL3 


pSPL3B 


No insert (Control) 

96.4B 

97. IOC 
109.8C 


203/400(51%) 
10/185(5%) 
87/190 (46%) 
39/246(16%) 


1/220 (0.4%) 
0/286 (0%) 
0/104(0%) 
4/183 (2%) . 


Overall (excluding control) 


136/621 (22%) 


4/573 (<1%) 



* Numerators indicate the number of colonies hybridizing to the HIV- 
tat intron, while denominators indicate the total number of colonics 
from each plating. The percentage of HiV-wt-positive colonies is indi- 
cated in parentheses. The overall number and percentage of HIV-(af 
positive products from the three PI experiments is shown in the 
bottom line. . 
Methods: Exon trapping was performed with three genomic PI clones 
from human chromosome 16 (96.4B. 97.10G and 109.8C) and the corre- 
sponding no insert controls using either pSPL3 or pSPL3B as described 
in the legend for Fig. 1. The resulting PCR products were subcloned 
into pAMPIO (LifcTechnologies) using the UDG cloning method as 
described by the manufacturer (LifcTechnologies) followed by 
transformation into competent E. coli DH5a cells. Colony lifts were 
performed and the resulting filters were hybridized at 65°C m 0.25 !vt 
Na-phosphate pH 7.2/7% SDS, using the HlV-far intron as a 
probe (described in the legend to Fig. 1): Filters were washed in 
0.2 x SSC/0.1 % SDS at 65°C for 30 min. 

(data not shown). A simple solution to this problem 
would be to utilize restriction enzymes whose sites are 
present in the MCS of pSPL3 and cut within the Ap* 
gene of cosmid vectors, or between the ori and the Ap R 
gene in the cosmid. However, Pstl is the only enzyme in 
the pSPL3 MCS which cuts between the ori and the Ap R 
gene in supercos or pWE cosmid vectors. To eliminate 
cosmid vector contamination and maximize the selection 
of usable sites in the pSPL3 MCS, the Ap R gene of pSPL3 
was replaced with the Cm R gene as described in the legend 
to Table II. To examine the efficiency of this pSPL3 deriv- 
ative, two cosmids were shotgun subcloned into 
pSPL3-CAM and the ligation reactions were plated on 
LB + Ap and LB + Cm plates. The results from these 
experiments are summarized in Table II. The cloning 
efficiencies of the cosmid-derived plasmids, based on the 
transformation efficiency on Ap plates, was 36-43-fold 
higher than that of pSPL3-CAM on Cm containing 
plates, confirming that the majority of ligation products 
are indeed cosmid derived. Colony hybridizations were 
performed on the resulting colonies using cosmid- or 
pSPL3-specific oligo probes. On filter lifts from the 
LB + Cm plates, every colony was seen to hybridize with 
oligo probes specific for the trapping vector, whereas less 
than 10% (3/39 and 5/47 for LA2H2 and LA325A11, 
respectively) of the colonies was seen to hybridize with 
the cosmid specific oligo (data not shown). Conversely, 
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TABLE H 

Plating efficiencies from ligation reactions of pSPL3-CAM 



Cosmid 


Suhcloning efficiency 


Ap'/Cm" 
ratio 






Cm* b 




pSPL3-CAM (no insert) 

LA2H2 

LA325A11 


6.S x 10 7 
9.5 x I0 7 


1.8x10* 
2.2 x 10 6 


36:1 
43:1 



- i ransiormauuii tim-iwnviw — ■> ~ ~ 

pSPL3-CAM when plated on 50 ug Ap/mL 

* Transformation efficiencies based on colony-forming units per ug of 
pSPL3-CAM when plated on 25 ug Cm/ml. 

Methods' The pSPL3-CAM vector was engineered by removing a L6-lcb 
Bam HI fragment from pUC18CMR (ATCC No. 37718; Schweizer, 
1990), which was then partially end-filled to leave a G overhang using 
Pollk' The resulting 1.6-kb Cm 11 fragment was ligated with a 5023-bp 
Bspm fragment from pSPL3, which had been partially end-filled to 
leave a C overhang. BspHl digestion of pSPL3 cleanly excises the Ap* 
cassette from the trapping vector. Cosmids LA325A1 1 and LA2H2 (con- 
taining genomic DNA from human chromosome 16) were digested with 
BamHl+Bgll and shotgun subcloned into BomHl-cut and dephos- 
phorylated pSPL3-CAM. The ligations were transformed into DH5a 
cells and plated onto LB agar plates containing 50 ug Ap/ml or 25 ug 
Cm/ml. Colonies were counted the following day. No colonies were 
seen on *no insert' plates. Transformation efficiencies represent the 
average of two platings. 



on filter lifts from the LB + Ap plates, more than 90% of 
the colonies from both reactions hybridized to a cosmid; 
specific probe (data not shown). Based on this analysis," 
if pSPL3 had been used instead of pSPL3-CAM, only, 
2-3% of the resulting colonies would contain a trappings 
vector-derived subclone. These data clearly demonstrate! 
that replacement of Ap R gene with the Cm R gene produces; 
a dramatic improvement in subcloning efficiency into the : 
splicing vector. 1 

(d) Conclusions and discussion 

(7 ) In our initial studies using exon trapping, the maj r- 
ity of the trapped products were found to contain cryptic 
elements from the HlV~tat intron of the pSPL3 splicing 
vector. In this report, we have demonstrated that removal 
of a cryptic exon within the HIV-tar intron of pSPL3 
significantly improves the efficiency of exon trapping. Foi 
all genomic PI clones examined, the number of cryptic 
products was dramatically reduced when the modified 
trapping vector was compared to the original pSPLJ 
vector. In studies which are currently ongoing, we have 
utilized the pSPL3B vector to scan approximately 750 B 
of genomic DNA for exons. In less than 1% of the case: 
examined did the trapped products hybridize to pSPL3l 
and not to the parental genomic PI clone (data no 

shown). . 
(2) We have also demonstrated that replacement of U* 
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A P * gene in pSPL3 with the Cm' gene increases the initial 
subcloning efficiency of genomic cosmid clones by 36-43 
told The presence of the Cm" gene eliminates recircular- 
• ized cosmid vector from the transformation products and 
insures that all the DNA used to transform the COS-7 
cells is P SPL3 derived. As the method was originally 
described with an Ap R splicing vector, as little as 2-3% 
of the DNA electroporated into COS cells would be 
oSPL3 recombinants. This may be sufficient for obtaining 
trapped exons; however, an increased likelihood of 
: obtaining trapped exons can be achieved by increasing 
'the initial subcloning efficiency by 36-43-fold with the 

> «SPL3-CAM vector. We have recently combined both 
improvements to the pSPL3 vector by adding the Crn" 
1 gene to the pSPL3B vector to produce pSPL3B-CAM. 

> (3) Church et al. (1994) found that approx. 4% of their 
f trapped products contained HIV-raf sequences from the 
! pSPL3 vector. However, using the P SPL3 exon-trapping 
hector as originally described, we found that on average, 
?20% of all trapped material was HIV-tor derived. In 

experiments using pools of plasmid minipreparations in 
I which all the pSPL3 clones contained subcloned genomic 
I DN as HIV-tar products represented up to 50% of the 
I trapped exons examined (data not shown), suggesting 
{that the HIV-rat products are independent of the initial 
[subcloning efficiency and are not due to non-recombinant 
lpSPL3 molecules being introduced into the COS cells. 
[The level of HIV-tor-derived products was found to differ 
iwith each PI or cosmid clone examined and may repre- 
Isent relative splicing efficiencies between the cryptic exon 
Tin pSPL3 and exons in the genomic subclones. The 
fdiflerence in abundance of the cryptic vector-derived 
[exon reported by Church et al. (1994) may also be a 
[consequence of clone-dependent variations in the relative 
[splicing efficiency of P SPL3 subcloned DNA. 
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